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Abstract 

The determination of the primary cosmic ray mass composition from 
the characteristics of extensive air showers (EAS), obtained at an ob- 
servation level in the lower half of the atmosphere, is still an open 
problem. In this work we propose a new method of Multiparametric 
Topological Analysis and show its applicability for the determination 
of the mass composition of the primary cosmic rays at the PeV energy 
region. 
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1 Introduction 



It is clear that due to large fluctuations in the longitudinal development of 
extensive air showers (EAS) and relatively weak sensitivity of EAS char- 
acteristics to the mass of the parent particle, the observed parameters of 
showers initiated by particles with different primary masses largely overlap. 
In order to estimate the primary mass in the case of the individual shower or 
to estimate the mean mass composition at a certain energy we have to use 
as many observables as possible. To process the big amount of information 
various multiparametric and non-parametric methods are employed. Among 
them are multivariate fitting, KNN-method, Bayesian approach, neural net 
analysis and others. In ^jy we proposed a new method of the multiparametric 
topological analysis (MTA) for the study of the mean mass composition on 
the basis of the EAS longitudinal development data. We tested this method 
using the data simulated in the EeV energy region for the Pierre Auger ex- 
periment and were encouraged by its accuracy and the easy applicability. 
In this contribution we test this method at the lower, PeV energy region 
close to the well known 'knee' in the primary cosmic ray energy spectrum 
(~3 PeV). Experimental arrays for the EAS study in this energy region are 
usually more compact and are able to measure the total size of electron N^, 
muon A^^, hadron Nf^ and Cherenkov light Npf^ components of the shower. 
The typical example of such a complex EAS array is K AS CADE |2]. Here 
we use only two parameters: A'^p and Nfj_, although the method is easily 
generalized for the larger number of observables. 



2 The simulated data 

In what follows, we assume that the primary energy estimate for the ob- 
served events is accurate at a few percent level. In fact, the primary energy 
estimator for KASCADE is Nj^' - the total number of low energy (> 0.23 
GeV) muons collected in the interval of distances between 40 and 200 m from 
the shower core. It has been shown that this number depends just on the 
primary energy and not on the primary mass. The accuracy of the primary 
energy estimate in the knee region made with this parameter is determined 



mainly by the Nj[ reconstruction error and is about 5% [3] 

The data set used to implement and test the new method consists of 
20000 vertical cascades produced by particles with the fixed energy of 0.5 
PeV, 8000 cascades for 1 PeV and 2000 cascades for 5 PeV, simulated using 
the CORSIKA program (version 6.023, ^) with the QGSJET interaction 
model. Simulations were performed using Naples PCs . The primary nuclei 
were P, He, O and Fe, each of them initiating 5000, 2000 and 500 cascades 
for 0.5, 1 and 5 PeV primary energy respectively. The CORSIKA output 
provides various characteristics of the shower: the total number of different 
particles, their lateral distribution and arrival times at different observation 
levels. 

3 Multiparametric Topological Analysis (MTA) 

The MTA method, when applied to the simulated data, relies on a topolog- 
ical analysis of correlations between the most significant parameters of the 
shower development in the atmosphere. In principle this method could be 
used also with a greater number of parameters, however, in this paper we 
restrict ourself to the simple case of two parameters only: logNg and logNf^ 
- the total number of electrons and muons at the sea level. A scatter plot of 
these two parameters has been built using the 4 x 4000 showers for 0.5 PeV, 

4 X 1000 for 1 PeV and 4 x 400 for 5 PeV primary energy. Figure ^ shows 
the scatter plot for only proton and iron induced showers of 1 PeV primary 
energy as well as their projected distributions. 

It can be seen that the populations arising from two nuclei are quite 
well separated in the logN^^ — logN^ plane, but not in the projected one- 
dimensional plots. The scatter plot of four primary nuclei: P, He, O and Fe 
shows much less separation even in the two-dimensional plot. The idea of 
the MTA method is to divide the entire area of the scatter plot into cells 
whose dimensions are defined by the accuracy with which the parameters 
can be measured. For this work, we adopted the accuracy and the size of the 
cell equal to A{logN^) = 0.05 and A{logNe) = 0.02, which corresponds to 
the experimental errors of KASCADE array 0. In each cell we can find the 
total number of showers N^^^, as the sum of Np, N^i^^, Nq and N^^ showers 
induced by P, He, O and Fe respectively, and then derive the associated 
frequencies: p},=N},/<,, pi^e=Nke/NLt, Po=%/^iot and p^=NipjNl^ 
which can be interpreted as the probability for a real shower falling into 
the i*'' cell to be initiated by proton, helium, oxygen or iron primary nuclei. 
In other words, in the case of an experimental data set of N^xp showers, 
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Figure 1: (a) logN^ vs. logNe scatter plot for proton (star) and iron (circle) induced 
showers at 1 PeV primary energy, b) Projected distributions of the scatter plot in (a) for 
protons (dashed line) and iron nuclei (full line) 



it may be seen as composed of a mixture of ^exp>< Pp proton showers, 
NexpX PHe helium showers, Ng^^pX po oxygen showers and Ne^pX ppe iron 
induced showers, where pA = '^iP\- As a probe set of 'experimental' showers 
4 subsets of showers, different from those used for the determination of 
probabilities pA, have been used. Each subset consists of 500 showers for 
0.5 PeV, 200 showers for 1 PeV and 50 showers for 5 PeV primary energy. 
For each individual shower in a given subset the partial probabilities pp , 
Phe ! Po PFe have been read from the relevant cell i. The sum of such 
probabilities over the entire subset permits to estimate the probability for 
a shower of a given nature to be identified as a shower generated by the P, 
He, O or Fe primary particle. This probability is shown in Figure at E = 
1 PeV. One can see that the method works quite well being able to attribute 
the highest probability to the correct nuclei. 



4 Determination of the primary mass composition 

The obtained mean probabilities Pij for cascades induced by type i nuclei 
to be identified as those induced by j nuclei for the pure primary mass 
composition can be used for the reconstruction of the mixed primary mass 
composition as the coefficients in the system of linear equations: 
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Figure 2: Application of the MTA method to the logN^ — logNe scatter plot - mean 
probability Pij for the cascades induced by i nucleus: P, He, O and Fe (specified in 
headers) being identified as induced by j nucleus, indicated at abscissa, at E = 1 PeV. 



n'p = np ■ Pp^p + HHe ■ Phc^P + no ■ Pq^P + "-Fe • Ppe^P 

n'He = np ■ Pp^He + nHe ' Pue-^He + no ■ Po^He + npe ' Ppe^He 

n'o = np ■ Pp^O + nHe ■ Ppe^O + nHe ■ Po-*0 + nHe ■ Ppe^O 

n-'pe = np ■ Pp^Fe + nHe ' Ppe^Pe + no ■ Po^Fe + npe ' Ppe^Pe 

where np, nne, no and npe are the true numbers, which determine the pri- 
mary mass composition in the sample of = np + nne + no + npe cascades, 
which are observed as n'p, n'^^, n^ and n'p^, due to a misclassification. 

In order to invert the problem and to reconstruct the abundances nAi 
in the primary mass composition from the observed abundances n^. and 
the known probabilities Pij, we can apply any method capable to solve the 



inverse problem taking into account possible errors of the observed distri- 
bution and the constraint: 

Stin^, = ^Un'^^ = N (1) 

The observed abundances were simulated using a set of 4x800 cascades 
different from those used for the determination of Pij. At the moment the 
result of the solution for the uniform primary mass composition at E = 1 
PeV, with all abundances pA = 7^ = 0.25, reconstructed by MTA method 
is pp = 0.26, pHe = 0.22, po = 0.27 and ppe = 0.25. 

As an example of the non-uniform mass composition we have taken the 
case with pp = ppe = 0.1, pHe = Po = 0.4, which resembles the mass 
composition of cosmic rays at the knee energy of ~3 PeV 0. The total 
amount of cascades used for the simulation of this distribution is 2000 (200 
P + 800 He+ 800 O +200 Fe). The result of the solution for this case is 
PP = 0.14, PHe = 0.34, PO = 0.42 and ppe = 0.10 at E = 1 PeV . 

It is seen that absolute values of the abundance are reconstructed satis- 
factorily. 

For the time being the values of the probabilities Pij are assumed to be 
known precisely. A more accurate solution of the system, taking propely 
into account the errors of the probability, is under study. 

5 Application for experimental cascades 

It has to be stressed that all the results outlined above are biased by the 
fact that we are dealing with simulated 'noiseless' data and more realistic 
testing has to be performed using data which take into account the varying 
primary energy and inclination angle, the instrumental signature (noise) 
and the various sources of errors (uneven and incomplete sampling, etc.). 
Application of MTA is straightforward - the relevant simulations should be 
made with the largest possible statistics. 

The preliminary study of the effect of the experimental errors shows 
rather strong dependence of the probabilities Pij on the adopted errors. 
In Figure |31 we show the probabilities Pa for the correct identification of 
nuclei as a function of the primary energy. The experimental accuracies 
of the shower size determination have been taken as A{logNe) = 0.02 and 
A{logNfj_) = 0.05. It is seen that while the energy dependence is week the 
reduction of probability Pa is substantial. 
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Figure 3: The probability of the correct identification of the primary nucleus Pa as 
the function of its energy without (full circles) and with (open circles) an account of 
experimental errors: A{logNe) = 0.02 and A{logNfi) = 0.05. The primary nuclei are: 
(a)-P ; (b)-He ; (c)-0 ; (d)-Fe. 



6 Conclusions 

We proposed and tested the Multiparametric Topological Analysis method 
for the determination of the mean primary cosmic ray mass composition 
on the basis of measurements of just two parameters of the observed EAS 
- the total number of electrons and muons. Definitely the method needs 
further development before being used for the processing of real experimental 
cascades, but the first results are very encouraging. The next efforts should 
include the increase of simulation statistics, the study of the effect of the 
shower energy spectrum and angular distribution, the accurate estimate 
of the reconstruction errors. The great advantage of the proposed MTA 
method is that it is extremely easy to use and generalize for the larger 
number of observables. 
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